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'q^'- abstract 

We present the results of reconstruction analysis of the galaxy distribution in a 
Q ■ spherical region of radius 50h~^ Mpc centered on the Local Group, as mapped by 

Q ■ the IRAS Point Source Catalog Redshift Survey (PSCz). We reconstruct this galaxy 

^f^ . distribution using 15 different models for structure formation in the universe, each 

model consisting of a set of assumptions regarding the value of the cosmological mass 
density parameter 0^, and the amplitude and nature of the biasing between IRAS 
K* I galaxies and the underlying mass. For every model, we also reconstruct ten mock PSCz 

pg I catalogs derived from the outputs of numerical simulations that have the appropriate 

CN ' values of Qm and bias. We quantify the accuracy of a reconstruction using a variety 

of statistics and compare the accuracy of each reconstruction of the PSCz catalog to 
0> . the accuracy expected based on the mock catalog reconstructions of the corresponding 

model. We find that gravitational instability of Gaussian primordial mass density 






% 



fluctuations can account for the galaxy distribution in the PSCz catalog, at least for 
some plausible assumptions about the value of Qm and the biasing between IRAS 



O 

H I galaxies and mass. However, unbiased models in which IRAS galaxies trace mass fail 

^ ' to reconstruct the PSCz catalog accurately, both for Qrn = 0.4 and for 0^ = 1. Low 

Qm models in which IRAS galaxies are antibiased with respect to the mass distribution 

are the most successful in reconstructing the PSCz catalog. In particular, a model 

with Qjn = 0.4 and IRAS galaxies related to the mass distribution according to the 

predictions of a semi-analytic galaxy formation model is very successful in reproducing 

the properties of the PSCz galaxy distribution. 
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1. Introduction 

Understanding the formation and evolution of large scale structure (hereafter LSS) in the 
universe is one of the foremost problems in cosmology. In the standard approach to studying 
LSS, one starts with a model for the primordial mass density fluctuations, uses analytical 
approximations or numerical simulations to predict the ensemble averaged statistical properties of 
galaxy clustering, and compares them with those of the observed galaxy distribution, assuming 
that we observe fair sample of the universe. However, we cannot expect a simulation started 
from random initial conditions to reproduce the specific structures observed in a galaxy redshift 
catalog, even if the statistical properties of the galaxy clustering are correct. Reconstruction 
analysis is a complementary approach to the study of LSS in which one works backward from the 
observed galaxy distribution to the initial mass density fluctuations in the same region of space, 
then evolves these model initial conditions forward in time to the present day using an N-body 
code. The reconstruction method incorporates assumptions about the properties of primordial 
fluctuations, the values of cosmological parameters, and the "bias" between the galaxy and mass 
distributions. These assumptions can be tested by comparing in detail the evolved reconstruction 
to the original galaxy redshift data. A by-product of the reconstruction analysis is a detailed 
model for the origin and evolution of familiar, well studied structures in the local universe, such 
as the Great Attractor, the Perseus-Pisces supercluster, and the Sculptor Void. 

In this paper, we present the results of reconstruction analysis of the IRAS Point Source 
Catalog Redshift survey (hereafter PSCz, Saunders et al. 1995; Canavezes et al. 1998), using the 
"hybrid" reconstruction method described by Narayanan & Weinberg (1998, hereafter NW98). 
First, we create the galaxy density fleld smoothed with a Gaussian fllter of radius Rg = Ah~^ Mpc 
(where h = Hq/\QQ km s~^ Mpc" ), correct for the effects of bias and redshift-space distortions, 
and derive the smoothed initial mass density fleld by tracing the evolution of these density 
fluctuations backward in time. We then evolve these initial density fluctuations forward using 
an N-body code, assuming a value for 0.^-, and compare in detail the clustering properties of the 
reconstructed PSCz galaxy distribution to those of the input PSCz galaxy distribution, either 
assuming that galaxies trace mass or selecting galaxies from the N-body particle distribution using 
a biasing prescription. For our purposes, therefore, a model of structure formation consists of a 
value of r^m; a bias factor h that gives the ratio of rms galaxy and mass fluctuations on a scale 
of 8/i."^ Mpc, and an explicit biasing scheme that specifles how galaxies are to be selected from 
the large scale mass distribution. All of our models assume that structure grew by gravitational 
instability from Gaussian primordial fluctuations — these are the implicit assumptions of the 
reconstruction method itself. We reconstruct the PSCz catalog using 15 different models and 



quantify the accuracy of each reconstruction using a variety of clustering statistics. Even if the 
model assumptions are correct, we do not expect to reproduce the observed structure exactly, 
because we begin with imperfect data and because the reconstruction method cannot invert 
gravitational evolution in the strongly non-linear regime. For each statistic, we therefore rank 
the accuracy of the model's PSCz reconstruction with respect to the reconstructions of ten mock 
PSCz catalogs derived from the outputs of N-body simulations of the model under consideration. 
Finally, we use these ranks to evaluate the success of the PSCz reconstruction for each model, to 
constrain 17^, and to test models of bias between the mass and IRAS galaxy distributions in the 
real universe. 

The hybrid reconstruction technique (NW98) that we use for our PSCz analysis combines the 
complementary desirable features of the Gaussianization mapping method (Weinberg 1992), which 
assumes Gaussian primordial fluctuations and a monotonic relation between the smoothed initial 
mass density field and the smoothed final galaxy density field, and the dynamical reconstruction 
methods of Nusser &: Dekel (1992) and Gramann (1993a), which are based on the momentum and 
mass conservation equations, respectively, under the approximation that the comoving trajectories 
of mass particles are straight lines (the Zel'dovich [1970] approximation). In the hybrid method, 
we first recover the smoothed initial density field from the smoothed final mass density field 
using a modified form of the dynamical methods, then Gaussianize this recovered initial density 
field to robustly recover the initial fluctuations even in the non-linear regions (an approach also 
used by Kolatt et al. 1996) Jj For a reconstruction that incorporates biased galaxy formation, we 
precede the dynamical reconstruction step with a step that maps the smoothed galaxy density 
field monotonically to a smoothed mass density field with the theoretically expected (non-linear, 
non-Gaussian) PDF. The hybrid method is described and tested in detail in NW98, who show that 
it can reconstruct a galaxy redshift survey more accurately than either the Gaussianization method 
or the dynamical reconstruction methods alone. Comparison of the hybrid method to a variety 
of alternative reconstruction schemes, including the Path Interchange Zel'dovich Approximation 
(PIZA) method of Croft & Gaztaiiaga (1997), is given by Narayanan & Croft (1999). 

Earlier attempts to reconstruct observational data include the reconstruction of the Perseus- 
Pisces redshift survey of Giovanelli & Haynes (1989) by Weinberg (1989) using the Gaussianization 
technique and the reconstruction of the IRAS 1.2Jy redshift survey ( [Fisher et al. 1995 ) by Kolatt 



et al. (1996) using the dynamical scheme of Nusser & Dekel (1992). This dynamical scheme was 
also used by Nusser, Dekel & Yahil (1995) to recover the PDF of the initial density field from the 
IRAS 1.2Jy redshift survey. The primary requirements for a redshift survey to be suitable for 
reconstruction analysis are: (a) good sky coverage and depth, so that the gravitational influence 
of regions outside the survey boundaries is small, (b) dense sampling to reduce shot noise errors, 
and (c) a well understood selection function, to allow accurate construction of the observed galaxy 
density field. With respect to these criteria, PSCz is a substantial improvement on samples used 



^To "Gaussianize" a field, one applies a local, monotonic mapping that enforces a Gaussian 1-point PDF. 



in previous analyses, and it is the best all-sky redshift survey that exists today. The PSCz is a 
redshift survey of all galaxies in the IRAS Point Source Catalog whose flux at 60 fim is greater 
than 0.6Jy. It contains about 15,500 galaxies distributed over 84.1% of the sky, excluding only 
some regions of low Galactic latitude where the extinction in the V band as estimated from 
the IRAS 100 /um background is greater than 1.5 mag. (mainly the low Galactic latitude zone 
|6| < 5°), the Magellanic clouds, some odd patches contaminated by Galactic cirrus, and two strips 
in ecliptic longitude not surveyed by the IRAS satellite. 

The plan of this paper is as follows. In §2, we describe the hybrid reconstruction method 
used to reconstruct galaxy redshift surveys, outline the assumptions involved in the reconstruction 
analysis, and list all the steps involved in reconstructing the PSCz catalog in the order in which 
they are implemented. In §3, we describe the various models that we use to reconstruct the 
PSCz catalog, and our construction of the mock PSCz catalogs for each model from the outputs 
of N-body simulations. In §4, we illustrate the results of reconstruction analysis for 6 of our 
15 models, using a variety of statistics. We quantify the accuracy of the PSCz reconstruction 
of a model using a "Figure-of-Merit" for each statistic, and rank the PSCz reconstruction with 
respect to all the mock catalogs for that model. We summarize the results of reconstructing 
the PSCz catalog for the full set of 15 models in §5, and describe the criteria for classifying a 
model as Accepted, or Rejected, based on its rankings. We review our results and discuss their 
implications in §6, drawing general conclusions about the value of f^m, the amplitude of mass 
density fluctuations, the bias between IRAS galaxies and mass, and the viability of gravitational 
instability with Gaussian initial conditions as an explanation for the structure observed in the 
PSCz redshift survey. A brief overview of our results can be obtained from Figures 3, 7, and 13, 
and Table 2. 



2. Reconstruction Analysis 

We reconstruct the galaxy distribution in the PSCz catalog using the "hybrid" reconstruction 
method of NW98. We refer the reader to NW98 for a detailed discussion of the method, including 
its general motivation and tests of its accuracy on N-body simulations. Here we provide a summary 
of the assumptions made in the reconstruction (§2.1), justification of our choice of smoothing 
length and sample radius for PSCz (§2.2), and a step-by-step description of the reconstruction 
procedure as applied to PSCz (§2.3). 



2.1. Assumptions 

Reconstruction analysis of a galaxy redshift catalog incorporates a number of assumptions, at 
various stages. These include assumptions about the cosmological parameters, about the nature 
of the primordial mass density fluctuations, about the process of structure formation, and about 



the physics of galaxy formation. The assumptions in our analysis are: 

(1): Structure formed by gravitational instability. The reconstruction procedure traces the 
evolution of density fluctuations backward in time under the assumption that the LSS 
formed from the gravitational instability of small amplitude fluctuations in the primordial 
mass density field. This assumption is also implicit when the power-restored initial density 
field is evolved forward in time using a gravitational N-body code. 

(2): The primordial density fluctuations form a Gaussian random field, as predicted by simple 
inflationary models for the origin of these fluctuations ( Puth fc Pi 1982 ; Hawking 1982 



[Starobinsky 1982 ; Bardeen, Steinhardt, & Turner 1983). This assumption is the basis of the 



Gaussianization step of the reconstruction procedure. 

(3): The values of Qm and the bias factor b = crgg/crgm, where agm (^Sg) is the rms fluctuation 
of the mass (IRAS galaxy) distribution in spheres of radius 8h^^ Mpc. We vary these 
assumptions from one reconstruction to another. We use the value of 0^ in correcting for 
redshift-space distortions and in forward evolution of the reconstructed initial conditions. 
We use the value of asm = o'Sg/b to normalize the forward evolution simulations. Note that 
throughout the paper we use b to refer to the rms fluctuation bias on the 8h~^ Mpc scale. 

(4): The shape of the primordial power spectrum. In contrast to the amplitude crsm; changes to 
the power spectrum shape make little difference to our results, because the shape is used 
only to compute corrections to or extrapolations of the initial power spectrum recovered 
from the observational data. 

(5): The evolved galaxy density field is a monotonic function of the evolved mass density field, 
once both are smoothed over scales of a few h~^ Mpc. This assumption, together with the 
value of o"8m and the assumption of Gaussian initial conditions, allows us to recover the 
smoothed mass density field from the smoothed galaxy density field in preparation for the 
time-reversed dynamical evolution. 

(6): An explicit biasing scheme, i.e., a prescription for relating the underlying mass distribution 
to the observable galaxy distribution. This scheme does not influence the recovery of initial 
conditions, but it is needed to select galaxies from the N-body simulation evolved from these 
initial conditions, and hence to compare the reconstruction to the input data. Most of our 
biasing schemes have a single free parameter that controls the strength of the bias. We set 
the value of this parameter to obtain the desired bias factor b (assumption 3). 



2.2. Choice of smoothing length and sample radius 

Since the PSCz is a flux-limited survey, the number density of galaxies in the catalog 
decreases with distance from the observer. Consequently, the shot-noise in the PSCz galaxy 



distribution increases with distance. However, the Gaussianization procedure in the reconstruction 
analysis relies on the assumption that the rms amplitude of galaxy density fluctuations remains 
the same throughout the reconstruction volume and that the contribution to these fluctuations 
from shot-noise in the galaxy distribution is negligible. In order to ensure that the shot-noise 
remains small and does not increase with distance from the observer, we create a volume-limited 
PSCz sub-catalog in which the number density of galaxies remains constant throughout the 
reconstruction volume. 

Much of the diagnostic power of the reconstruction analysis stems from the fact that non-linear 
gravitational evolution transfers power from large scales to small scales ( Melott &: Shandarin 1990| ; 



Beacom et al. 1991 ; Little, Weinberg, & Park 1991; Soda fc Suto 1992 ; Bagla fc Padmanabhan 
1997). This power transfer erases the information about the initial mass fluctuations on scales 
below the non-linear scale (Fourier wavenumbers k > k^i). Consequently, a reconstruction method 
that recovers the initial fluctuations accurately up to the non-linear scale k = /cni can reproduce 
many features of the evolved structures even on smaller scales {k > k^i), though not, of course, the 
finer details of these features. Since the rms fluctuation of the IRAS galaxy distribution in spheres 



of radius 8h ^ Mpc is about 0.7 (Saunders, Rowan-Robinson &: Lawrence 1992; Fisher et al. 1994; 



Moore et al. 199'| ), we need to recover the initial density field accurately at least up to this scale 



in order to take advantage of the power-transfer phenomenon. We will therefore reconstruct the 
PSCz catalog using a Gaussian smoothing length of Rs = 4/i~^ Mpc, which corresponds to a 
top-hat smoothing scale of about 6.6/i^^ Mpc. 

We create the volume-limited sub-catalog by selecting all the galaxies in the PSCz located 
within a volume-limiting radius that are bright enough to be included in the survey even when they 
are located at this volume-limiting radius. We choose this volume-limiting radius, Ri, based on a 
compromise between two confiicting requirements. First, the reconstruction volume should be large 
enough that it contains many independent smoothing volumes. This criterion pushes us to choose 
a large value for Ri. Second, the shot-noise in the galaxy distribution of the volume-limited catalog 
should be small and remain constant with distance from the observer. This condition requires a 
uniformly high number density of galaxies, pushing us to adopt a smaller volume-limiting radius. 
Since we reconstruct the PSCz catalog using a Gaussian smoothing length of Rs = Ah~^ Mpc, we 
fix Ri so that the mean inter-galaxy separation aX Ri is d = Ug = V2Rs = 5.6/i~^ Mpc. We 
adopt this criterion d = V^Rg based on the rule of thumb suggested by Weinberg, Gott, Sz Melott 
(1987), to obtain the largest possible reconstruction volume while keeping shot-noise in the galaxy 
distribution small enough to have little effect on the smoothed galaxy density field. 

We compute the number density of galaxies as a function of the distance from the observer in 
the PSCz catalog using the maximum-likelihood method described by Springel & White (1998). 
We find that the number density of galaxies drops to 0.005 h^Mpc^^ = (5.6/i^^ Mpc)~^''^ at a 
distance of i?i = 50/i^^ Mpc from the observer. We then select all the galaxies in the PSCz 
catalog that are bright enough to be included in the survey even if they are placed at a distance of 
50h~^ Mpc from the Local Group. The galaxies selected in this manner are then included in the 



PSCz sub-catalog, which is thus volume-hmited to Ri = 50/i Mpc. The luminosities at 60 fj,m 

^0 



of the galaxies in this sub-catalog satisfy the condition log^^Q ( -^ 1 > 9.37 (for h = 1) 



2.3. Step-by-step description 

The steps involved in reconstructing a 50/i^^ Mpc, volume-limited subset of the PSCz catalog 
for a given set of model assumptions are as follows: 

Step 1: Create an all-sky galaxy distribution by "cloning" the galaxy distribution to fill in the 
regions excluded in the PSCz survey. The PSCz catalog does not include galaxies in regions 
of low Galactic latitude where there is substantial obscuration by dust. However, this region 
could be dynamically important, since the Perseus-Pisces supercluster and the Hydra-Centaurus 
supercluster are both located close to the Galactic plane and could even extend across it. Hence, 
we fill in the region with Galactic latitude |6cut| < 8°) using the cloning technique introduced by 
Lynden-Bell, Lahav & Burstein (1989; see also Yahil et al. 1991). We divide this region into 36 
angular bins of 10° in longitude and divide the redshift range in each angular bin into bins of 
1000 km s^^. In each longitude-redshift bin, we assign N{l,z) artificial galaxies, where N{l,z) is 
equal to a random Poisson deviate whose expectation value is equal to the average density of the 
corresponding longitude-redshift bins in the adjacent strips |6cut| < |^| < 2|6cut|, times the volume 
of the bin. If there is a real PSCz galaxy in any of these longitude-redshift bins, we include it in 
place of an artificial galaxy. We fill the masked regions at high Galactic latitudes with a random 
distribution of artificial galaxies having the observed mean density. The flux distribution of the 
artificial galaxies is identical to those of the real galaxies in the PSCz catalog. We tested (using 
mock PSCz catalogs) alternate methods of handling the mask region, including assigning galaxies 
at random locations within the mask region at the mean galaxy density, as well as ignoring all 
the galaxies in the mask region. We found that the cloning technique always leads to the most 
accurate reconstruction of the galaxy distribution. However, the mask region does not influence 
reconstruction analysis as much as it influences say, the analysis of the cosmological galaxy dipole 
(Rowan- Robinson et al. in preparation). 

Step 2: Select a volume-limited galaxy distribution from the flux-limited PSCz catalog so 
that the shot-noise in the volume-limited catalog is small and remains constant throughout 
the reconstruction volume. Based on the selection function of the PSCz survey, we choose 
the volume-limiting radius Ri = 50/i^^ Mpc, where the mean inter-galaxy separation is 
d = V2Rs = 5.6h-^ Mpc. 

Step 3: Compute the smoothed galaxy density field in redshift space. We create the PSCz galaxy 
density field in redshift space by cloud-in-cell (CIC) binning (Hockney & Eastwood 1981) the 
volume-limited galaxy distribution onto a 100^ cubical grid that represents 200/i~^ Mpc on a side. 
The Local Group observer is at the center of this cube, and the three sides of the cube are oriented 
along the axes of the Supergalactic coordinate system. Since the dynamical component of the 
hybrid reconstruction method traces the evolution of the gravitational potential backward in time, 
it is necessary to model the gravitational field in the regions beyond the boundaries in order to 



reconstruct the density field accurately near the edges of the volume-limited catalog. We therefore 
supplement the volume-limited density field with the density field in an annular region 20h~^ Mpc 
thick beyond the volume-limiting radius of 50h~^ Mpc. We form the density field in this annular 
region by weighting each galaxy by the inverse of the selection function of the flux-limited PSCz 
survey at the location of the galaxy. The full galaxy density field is therefore constructed from a 
volume-limited catalog in the region < i? < -Ri = 50h^^ Mpc and from a flux-limited catalog in 
the region Ri < R < R2 = 70h^^ Mpc. We fill the regions beyond R2 with uniform density equal 
to the mean density of the galaxy distribution in the volume-limited catalog, n = 0.005 h^Mpc"^. 
We smooth the galaxy density field using a Gaussian filter of radius Rs = 4/i~^ Mpc. We account 
for boundary effects in computing the smoothed density field Psmi^) by using the ratio smoothing 
method of Melott & Dominik (1993), 

/ M{r')p{r')W{r - r')d^r' 
^^'"^'■^ " jM{r')W{r-r')d^r' ' ^ ' 

where W{r) is the smoothing filter, and the mask array M(r) is set to 1 for pixels inside the survey 
region and to for pixels outside the survey region. The rms amplitude of the galaxy density field 
smoothed with a Gaussian filter of radius Rs = 4/i~^ Mpc is (T4g = 0.85. 

Step 4-' Monotonically map the galaxy density field onto a theoretically determined PDF of the 
underlying mass distribution. In an unbiased model, the galaxy density field is identical to the 
mass density field, so we skip this step entirely. In a biased model, we derive the mass density 
field using the PDF mapping procedure described in NW98. We first assume a value for the bias 
factor b and estimate the rms linear mass fluctuation using the equation 

asm = ^, (2) 

where asg and crgm are the rms fluctuations in 8/i~^Mpc spheres in the non-linear galaxy density 
field and the linear mass density field, respectively. We use an N-body code to evolve forward in 
time an ensemble of initial mass density fields, all drawn from the same assumed power spectrum, 
and all normalized to this value of cxsm- We then derive an ensemble-averaged PDF of the 
smoothed final mass density fields from the evolved mass distributions of these simulations. While 
reconstructing the PSCz catalog using a model that corresponds to this value of agm, we derive a 
smoothed final mass density field by monotonically mapping the smoothed PSCz galaxy density 
field to this average PDF. This step implicitly derives and corrects for the only monotonic local 
biasing relation that is simultaneously consistent with the observed galaxy PDF, the assumed 
shape of the power spectrum and value of b, and the assumption of Gaussian initial conditions. 
Step 5: Correct for the effects of redshift-space distortions. The peculiar velocities of galaxies 
distort the mapping of galaxy positions from real space to redshift space, making the line-of-sight 
a preferred direction in an otherwise isotropic universe. Since we need the real space mass density 
field to recover the initial mass density fluctuations, we need to correct for these redshift-space 
distortions. On small scales, the velocity dispersion of a cluster stretches it along the line-of-sight 



into a "Finger of God" feature that points directly toward the observer (e.g., de Lapparent, Geller 



& Huchra 1986), thereby reducing the amphtude of small-scale clustermg. To correct for this 
effect, we first identify the clusters in redshift space using a friends-of- friends algorithm that uses 
a transverse linking length of 0.6h~^ Mpc and a radial linking length of 500 km s~^ (Huchra & 
Peller 1982| ; [Nolthenius fc White 198^ ; Moore, Frenk, & White 1993; Gramann, Cen, & Gott 
1994). For each cluster, we then shift the radial locations of its member galaxies so that the 
resulting compressed cluster has a radial velocity dispersion of 100 kms~^, roughly the value 
expected from Hubble flow across its radial extent. On large scales, the distortions arise from 
coherent inflows into overdense regions and outflows from underdense regions ( ^argent Sz Turiier 



1977 ; Kaiser 1987 ). To remove these distortions, we apply the following iterative procedure, which 



is a modified version of the method suggested by Yahil et al. (1991) and Gramann et al. (1994). 
This method is described in detail in NW98, and we give only a brief outline here. After deriving 
the mass density field in step (4), we predict the velocity field using the second order perturbation 
theory relation ( Gramann 1993b[ ), 



v(r) = f{nm)H 



(r) + -VC,(r) 
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(3) 



where g(r) is the gravitational acceleration field computed from the equation V 
and Cg is the solution of the Poisson type equation 



g(rj 



V'C„ 



i=3 j=3 
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1=1 j=i+l 



d'cPg d\ 



d^ 



dxi 



dx] 



dxjdxi 



(4) 



Equation (^ requires that we assume a value of Q.m-, to compute the factor f{^rn) ~ ^m^ ( [Peebles 



1980). Finally, we correct the positions of galaxies so that their new positions are consistent with 



their Hubble fiow and the peculiar velocities at their new locations. We repeat these three steps 

until the corrections to the galaxy positions become negligible, which usually occurs within 3 

iterations. 

Step 6: Apply the dynamical reconstruction scheme to evolve the fiuctuations backward in time. 

We compute the gravitational potential from this smoothed mass density field using the Poisson 

equation, then evolve this gravitational potential backward in time using our modified version 

of the Gramann (1993) method. We use Poisson equation to derive the initial mass density 

fluctuations i.e., the fiuctuations that grow according to the predictions of linear theory. 

Step 7: Gaussianize this dynamically reconstructed initial mass density field. This step enforces 

a Gaussian PDF for the initial mass density fiuctuations and yields robust reconstructions even in 

the non-linear regions. 

Step 8: Restore power to the recovered initial density field. Non-linear gravitational evolution 

tends to suppress the small-scale power in the reconstructed density field, beyond the suppression 

due to the Gaussian smoothing alone. We correct for this effect using the "power restoration" 

procedure described in Weinberg (1992). Using an ensemble of numerical simulations, we compute 

a set of correction factors C{k) defined by 

■Priky^''' 



C{k) 



P^{k) 



(5) 
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where Pi{k) is the power spectrum of a simulation's smoothed initial conditions, and Pr{k) is the 
power spectrum of the density field recovered by the hybrid reconstruction method. We multiply 
each Fourier mode of the reconstructed density field by C{k) and also multiply by exp(fc^-R^/2) 
in order to remove the effect of the original Gaussian smoothing. Above some wavenumber 
kcorr ~ 7r/i2ni) where R^i is the scale on which the rms fluctuations are unity, non- linear evolution 
erases the information about the phases in the initial density field ( [Little et al. 1991 ; Ryden fc 



Gramann 1991 ) to the point that the hybrid method cannot recover it. For /ccorr < k < /cNyq, 
therefore, we add random phase Fourier modes with an assumed shape for the power spectrum, 
where /cNyq is the Nyquist frequency of the grid on which we define the density fields. We 
normalize this power spectrum by fitting it to the power spectrum of the recovered initial density 
field in the range of wavenumbers ki < k < k2, where ki and k2 are wavenumbers in the linear 
regime, with ki < k2 < ^corr- In the range of wavenumbers k2 < k < kcorr, multiplication by the 
large factor exp(A:^-R^/2) can distort the shape of the power spectrum, although this range of 
wavenumbers is only in the mildly non-linear regime. In this range of Fourier modes, therefore, we 
preserve the phases of the recovered initial density field but fix the amplitude of the modes to be 
that determined by the fitting procedure. In all of our simulations, /cNyq = 50k f, and we choose 
^corr = ISfcj, ki = Akf and k2 = 8k f, where kf = 27r/Lbox = 0.0314 hMpc~^ is the fundamental 
frequency of the simulation box. We assume that the shape of the power spectrum is governed by 
the parameter F, which is equal to ^mh in cold dark matter models with small baryon fraction 
and scale- invariant inflationary fluctuations (Efstathiou, Bond, & White, 1992). We do not add 
the small scale power using the technique of constrained realizations ( [Bertschinger 1987 ; Hoffman 



fc Ribak 1991 ; van de Weygaert fc Bertschinger 1996| ) because we find it does not lead to a more 



accurate reconstruction even for a dense sampling of the constraints (see NW98 for a more detailed 
discussion) . 

Step 9: Evolve the power-restored density field forward in time using an N-body code. We evolve 
the reconstructed initial mass distribution using a particle-mesh (PM) code, assuming the values 
of VLm and Q-\. This code is described and tested in Park (1990). We use 100^ particles and a 200^ 
force mesh in the PM simulations. We start the gravitational evolution from a redshift z = 23 
and follow it to z = in 46 equal incremental steps of the expansion scale factor a{t). We fix the 
amplitude of the linear mass density fluctuations to be crgm = <y?,g/b, where h is the bias factor. 
In the case of truly unbiased models (as opposed to biased models with h = 1.0), we instead fix 
the amplitude of the linear mass fluctuations by requiring that the non-linear rms amplitude of 
fluctuations in redshift space smoothed with a Gaussian filter of radius 4/i~^ Mpc {(J4_G,g) from the 
simulation match the observed value. 

Step 10: Compare the evolved distribution with the original galaxy distribution, either assuming 
that galaxies trace mass or using a local biasing model to select galaxies from the mass distribution. 
In biased reconstructions, we choose the free parameter controlling the strength of the bias by 
requiring that the rms fluctuation a^^c of the reconstructed, redshift-space galaxy density field, 
smoothed with a Gaussian filter of radius 4/i~^ Mpc, match that of the original galaxy density 
field. 
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Figure 1 illustrates the intermediate steps in a hybrid reconstruction analysis of the PSCz 
catalog. Panel (a) shows the redshift-space positions of all galaxies in the volume-limited PSCz 
catalog, in a slice 30h~^ Mpc thick centered on the Supergalactic plane (SGP). Panel (b) shows a 
slice through the SGP of the galaxy density field smoothed with a 4h^^ Mpc Gaussian filter. The 
smoothed initial density field recovered by the hybrid reconstruction method is shown in panel 
(c). The mass distribution obtained by evolving the power-restored initial mass density field using 
an N-body code is shown in panel (d). The reconstructed, smoothed, redshift-space galaxy density 
field, and the reconstructed, redshift-space galaxy distribution obtained by selecting galaxies from 
the evolved mass distribution using a power-law biasing model (see §3.1 below) are shown in 
panels (e) and (f), respectively. The reconstruction illustrated in Figure 1 assumes fi^ = 0.4, 
17a = 0.6, and b = 0.64. 



3. PSCz Reconstruction: Models and Mock Catalogs 

3.1. Model Assumptions 

We use 15 different models to reconstruct the PSCz catalog and to create mock catalogs 
for calibrating reconstruction errors. Each model consists of a set of assumptions regarding the 
values of 0^ and f^A; the shape of the linear mass power spectrum P{k) as characterized by the 
parameter F described in step (8) of §2.3, the bias factor b defined as b = bg = crgg/agm, and the 
functional form of the biasing relation between IRAS galaxies and the underlying dark matter 
distribution. As discussed in §2.3, these assumptions influence the reconstruction analysis in 
different ways. The value of fi^ is required when we correct the input data for redshift-space 
distortions and when we evolve the power-restored initial conditions forward in time. The shape 
(F) and amplitude (crgm = (^8g/b) of the mass power spectrum are used to calculate the correction 
factors C{k), and the shape is used to extrapolate the recovered initial power spectrum above 
the wavenumber k > fccorr- The value of the bias factor b is required when we map the biased 
galaxy density field to the numerically determined PDF of the underlying mass density field with 
rms fluctuation amplitude agm = cr^g/b. It is also required in the forward evolution step when we 
evolve the power-restored initial mass density field to match the rms fluctuation amplitude crsm- 
We need to assume an explicit biasing scheme to select galaxies from the evolved mass distribution. 
Most of our biasing schemes have one free parameter that we fix so that the rms fluctuation of 
the resulting galaxy distribution matches that of the input galaxy distribution, and a random 
sampling factor that we use to match the number density of galaxies in our volume-limited PSCz 
sample. 

Our models span a wide range of cosmological and galaxy formation parameters, varying with 
respect to the following properties: 

(1): r^m and Q/\_: Our assumptions for the geometry of the background universe include Einstein 
de-Sitter models {Qm = 1-0, ^a = 0), open models (17^ < 1-0, ^a = 0), and flat models 
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Fig. 1. — Steps in the reconstruction analysis of PSCz catalog, (a) The observed PSCz galaxy 

distribution, (b) the observed PSCz galaxy density field, (c) the initial mass density field linearly 
extrapolated to the present epoch, (d) the reconstructed mass distribution, (e) the reconstructed 
PSCz galaxy density field, and (/) the reconstructed PSCz galaxy distribution. The contour 
plots show the density field in the Supergalactic plane, smoothed with a Gaussian filter of radius 
Rs = Ah~^ Mpc. Solid contours correspond to regions above the mean density p and are in steps of 
0.1 in log(p/p), while dashed contours correspond to regions below the mean density and are in steps 
of 0.2 in (/?//?). The galaxy distributions show all the galaxies in redshift space in a slice 30/i~^Mpc 
thick centered on the Supergalactic plane, and volume limited to a radius Ri = 50/i^^ Mpc. This 
reconstruction corresponds to the model L4PLb0.64 listed in Table 1. 
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with a non-zero cosmological constant {^m < 1-0, ^m + ^A = 1-0). 

(2): Normalization and shape of the power spectrum: We normaUze the amphtude of the 

primordial mass density fluctuations (characterized by cr^rn) either to be consistent with the 
level of anisotropics in the cosmic microwave background measured by the COBE satellite 
(the COBE normalization, ^moot et al. 199^ ) or to produce the observed abundance of 
clusters at the present epoch (the cluster normalization, White, Efstathiou & Frenk 1993; 



Eke, Cole & Frenk 1996; Viana fc Liddle 1996 ). For all cluster-normalized models, we 



choose the values of ^m and asm so that agm^^ = 0.55 ( [White et al. 1993 ), for both 



open and flat universes. We refer the reader to Cole et al. (1997) and Cole et al. (1998, 
hereafter CHWF98) for more details of the COBE and cluster normalization procedures. 
We define the shape parameter of the transfer function in the linear mass power spectrum 
via the parameter F of Efstathiou et al. (1992); for CDM models with low baryon content, 
F « ^mh. Our power spectra include scale-invariant (n = 1) models with F values consistent 
with the clustering properties measured from several galaxy catalogs, viz., F = 0.15 — 0.3 
(iMaddox et al. 1990| ; [Efstathiou et al. 1992| ; [Vogeley et al. 1992[ ; [Peacock fc Dodds 1994 



Gaztahaga, Croft, & Dalton 1995; Maddox, Efstathiou, & Sutherland 1996; paztahaga & 



Baugh 1998 ; Tadros, Efstathiou, & Dalton 1998), and some models with larger F values. We 
also consider power spectra that are normalized to both the COBE and cluster constraints, 
by introducing a tilt in the spectral index of the power spectrum. Finally, two of our models 
are not normalized to COBE or clusters, although the rms fluctuation of the reconstructed 
galaxy distributions matches that of the IRAS galaxies. 

(3): Bias factor: We consider models in which IRAS galaxies trace mass (unbiased, b = 1.0), 
models in which IRAS galaxies are more strongly clustered than the mass (biased, b > 1.0), 
and models in which galaxies are more weakly clustered than the mass (antibiased, b < 1.0). 
We even consider one biasing model in which the galaxies do not trace mass but b = 1.0, i.e., 
the rms amplitude of fluctuations in the galaxy and mass distributions are identical at the 
scale of 8h^^ Mpc, but the galaxy density has a non-linear dependence on the mass density. 

(4): Biasing scheme: Our biasing relations cover a wide range of plausible functional forms, 
with the only constraint being that they remain monotonic. These include functions 
derived empirically from observations of different types of galaxies, functions predicted 
from semi-analytic models of galaxy formation, functions that flt the results of numerical 
studies of galaxy formation, and functions constructed ad-hoc. All of our biasing models 
are deterministic, and are "local" in the sense that the efficiency of galaxy formation 
is determined by the properties of the local environment, i.e., by the properties within 
approximately one correlation length of the location of the galaxy. We compute all the local 
properties of the mass distribution in a sphere of radius 4/i~^ Mpc centered on the galaxy. 

The specific biasing schemes that we use to select the IRAS galaxies from the evolved mass 
distributions of our reconstructions are as follows. 
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Power-law bias: In this simple biasing model, the IRAS galaxy density (pg) is a steadily 
increasing, power-law function of the local mass density, {pg/Pg) oc (pm/Pm)^- The probability for 
an N-body particle with local mass density pm to be selected as an IRAS galaxy is therefore 



P = A{pm/pr, 



\B-1 



(6) 



We choose the values of A and B to reproduce the required number density and the rms fluctuation 

of the resulting galaxy distribution, respectively. This biasing relation is similar to the one 

suggested by Cen & Ostriker (1993) based on hydrodynamic simulations incorporating physical 

models for galaxy formation ( |Cen fc Ostriker 1992 ), but it differs in that there is no quadratic 

term that saturates the biasing relation at high mass densities. 

Threshold bias: In this biasing scheme, galaxy formation is entirely suppressed below some 

threshold value of mass density, and IRAS galaxies form with equal efficiency per unit mass in 

all regions above the threshold. This biasing scheme was adopted in some of the early numerical 

investigations of CDM models (e.g., Melott fc Fry 1986| ), and it has been used extensively in 

theoretical modeling of voids and superclusters (e.g., Einasto et al. 1994 ). In the density-threshold 

bias model, the probability that a particle with local mass density pm is selected as an IRAS 

galaxy is 

A iipm>B, 

f) iipm<B. ^ ' 

We choose the threshold density B to match the required bias factor 6, and the probability A 

to reproduce the desired galaxy number density. We note that, since this model preferentially 

populates regions of higher mass density, it can only lead to a bias factor greater than unity, and 

hence cannot be used when an antibias (6 < 1.0) is required. 

Morphology- density bias: It has been known for a long time that early-type galaxies are 

preferentially found in dense environments, while late-type galaxies dominate in less massive 

groups and in the field ( Hubble 1936 ; Zwicky 1937 ; [Abell 195q ). There have been numerous 



P 



efforts to quantify this connection between morphology and environment (e.g.. Dressier 198C ; 
Postman &: Geller 1984; Lahav fc Saslaw 1992| ; Whitmore, Gilmore, &: Jones 1993), using a 
variety of clustering statistics including angular correlation functions, redshift-space correlation 
functions, and de-projected real-space correlation functions ( pavis fc Geller 1976| ; Giovanelli, 



Haynes & Chincarini 1986; Loveday et al. 1995; Hermit et al. 1996; Guzzo et al. 1997 



Willmer et al. 1998). We model the "bias" arising from this morphological segregation using 



the morphology-density relation proposed by Postman &; Geller (1984). Since the IRAS-selected 
galaxy catalogs preferentially include dusty, late- type spirals ( poifer et al. 1984 ; Meiksin &; Davis 



1986; Lawrence et al. 1986; Babul &: Postman 1990), we select all the spiral galaxies as IRAS 



galaxies. The morphology-density relation of Postman & Geller (1984) assigns morphological 
types to galaxies based on the densities at the locations of all the galaxies. Since we evolve the 
power-restored density field forward in time using a low resolution PM code and with a finite 
number of mass particles, the final densities computed over spheres centered on the galaxies and 
large enough to contain significant numbers of neighbors will be different from the densities at the 
exact locations of the galaxies. Therefore, we recast the morphology-density relation of Postman 
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& Geller (1984) in terms of the density computed in a sphere of radius 2/i~^ Mpc centered on the 
galaxy. We assign the galaxy a spiral (Sp), SO, or elliptical (E) morphological type, with relative 
probabilities Fgp, Fgg, and F^ that depend on the density computed within a sphere of radius 
2h~^ Mpc. For p < pp = lOp, the morphological fractions are Fgp = 0.7, i*so = 0.2, and Fe = 0.1. 
For pF < p < PC = Q X lO^p, the fractions are 

Fsp = 0.7 - 0.2a 

Fe = 0.1 + 0.1a (8) 

i^so = 1-Fsp-Fe 

a = Iogio(p/PF)/Iogio(pc/PF)- 

For p > PC, the morphological fractions saturate at Fsp = 0.5, F^o = 0.3, and Fe = 0.2. The ratio 
of rms fluctuations in 8h^^ Mpc spheres of the elliptical and spiral galaxy distributions selected 
in this manner is 1.3, consistent with the ratio of ~ 1.2 — 1.7 observed between optical and IRAS 



galaxy distributions (Lahav, Nemiroff, &: Piran 1990; Strauss et al. 1992; Saunders et al. 1992; 
Peacock fc Dodds 199-^ ; Willmer, da Costa, & Pellegrini 1998; paker et al. 19981 ). This biasing 
scheme has no free parameters, so the resulting bias factor is known a priori. 
Square-root Exponential bias: We construct a biasing scheme in which the IRAS galaxy density 
field is related to the mass density field by 

y = Ay/x ex.p(ax) , (9) 

where x = Pm/Pm ^^^ V — Pg/'Pg ^^e the mass and the IRAS galaxy overdensities, respectively. 
We choose the values of a and A to reproduce the required galaxy rms fluctuation and the mean 
number density, respectively. This biasing relation is a monotonically increasing function for all 
a > 0. We include this ad-hoc biasing scheme to test the ability of reconstruction analysis to 
distinguish between different biasing relations with the same bias factor. We use this biasing 
scheme in a model in which galaxies do not trace the mass, although b = 1.0. We note that neither 
the Power-law bias nor the Threshold bias can lead to b = 1.0, for any non-trivial values of the 
free parameters governing the strength of the bias. 

Semi-analytic bias: Except for the connection between power-law bias and the simulations of Cen 
& Ostriker (1993), the biasing models described so far are not based on theoretical models of the 
galaxy formation process. Rather, they include a variety of reasonable functional forms that could 
plausibly represent the results of some more complete theory of biased galaxy formation. We now 
consider a biasing scheme that is motivated by a physical theory of galaxy formation namely, the 
semi-analytic galaxy formation model of Benson et al. (1999; see also Cole et al. 1994, 1999). We 
parameterize this biasing scheme as follows. We consider the luminosities and morphologies of all 
the galaxies selected by Benson et al. (1999) from the mass distribution of the ACDM2 simulation 
of Jenkins et al. (1998, the VIRGO consortium). We select as IRAS galaxies all the galaxies whose 
ratio of bulge to total mass is less than 0.4. The rms fluctuation of the IRAS galaxy distribution 
selected in this manner (crgg), is about 10% smaller than that of the underlying mass distribution. 
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The solid points in Figure 2 show the mean relation between this "IRAS" galaxy density 
field and the underlying mass density field, after both fields are smoothed with a top-hat filter of 
radius i?th = 3/i~^ Mpc. The thick solid line shows an empirical fit to this mean relation using a 
smoothly varying double power-law of the form 

y = Ax" [C + 2;(°-^)/t] "^ , (10) 

where a = 2.9, /3 = 0.825, 7 = 0.4, C = 0.08, A = 1.1, and x = {I + 5m), y = (1 + 5g) are the 
mass and the IRAS galaxy overdensities, respectively, smoothed with a top-hat filter of radius 
i?th = 3/i~^ Mpc. During the reconstruction analysis, we use this semi-analytic biasing relation to 
select the IRAS galaxies from the evolved mass distribution. We found that the scatter around 
this mean relation is dominated by shot-noise and hence ignore it in our parameterization of the 
semi-analytic bias model. This bias model does not have any free parameters, so it results in 
a known bias factor. Note that this relation was derived for a ACDM model with ugm = 0.9, 
although we apply it here to an open model with slightly lower usm so that it matches the a%g of 
IRAS galaxies. 



3.2. Models 

The 15 different models that we analyze in this paper sample the range of interesting values 
of the parameters Vtm and a^rn (or, equivalently, the bias factor b). Thus, we analyze models with 
Qm = 0.2,0.3,0.4,0.5, and 1.0, while the values of o"8m range from 0.4 to 1.44. The parameters of 
all these models are listed in Table 1. In our model nomenclature, the first two symbols denote 
the geometry of the universe and the value of the cosmological mass density parameter. Thus, El 
represents an Einstein de-Sitter model with Vtm = 1-0, JIa = 0; Ox represents an open model with 
^rn = O.x, Oa = 0; and Lx represents a flat model with 0^, = 0.x and a cosmological constant 
Oa = 1 — f^m- The capital letters immediately following specify the nature of the local biasing 
relation between the IRAS galaxy distribution and the underlying mass distribution. The last 
series of numbers after the letter h corresponds to the bias factor of the model. For example, 
the model 04SQEbl.O specifies an open model with 0^ = 0.4 in which the biasing relation is a 
square-root exponential function and the bias factor is 1.0. 

We now briefly describe the features of all the 15 models. In §4, we will illustrate our analysis 
methods and results using 6 representative models, before giving summary results for the full suite 
of 15 models in §5. The 6 illustrative models are: 

ElUNbl.O — An Einstein de-Sitter universe, in which IRAS galaxies trace the mass distribution 
(unbiased), and hence b = 1.0. The shape of the power-spectrum is consistent with the observed 



clustering of galaxies, F = 0.25 (Peacock & Dodds 1994). The mass normalization agrees with 



the measured value of a^g « 0.7 for this biasing model, but it is above the cluster normalization 

constraint a^m = 0.55 for $7^ = 1- 

ElPLbl.8 — An Einstein de-Sitter universe, in which there is a power-law biasing relation between 
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Fig. 2. — The mean relation between the IRAS galaxy overdensity, Pg/'Pg = (1 + (5g), and the mass 
overdensity, Pm/'Pm — (1 + (^m), in the Semi-analytic galaxy formation model. Both the density 
fields are smoothed with a top-hat filter of radius -Rth = 3/i^^ Mpc. The solid points show the 
mean relation obtained from the simulation of Benson et al. (1999), while the thick solid line shows 
the empirical fit to it using a smoothly varying double power law, described by equation (|lO|). 
The dotted line shows the relation Pg/'Pg = Pm/limi corresponding to an unbiased model in which 
galaxies trace mass. 



18 



the IRAS galaxy and mass distributions. We choose the value of B so that b = 1.8 and the value 
of A so that Ug = 0.005/i^Mpc"'^. The mass fluctuation amplitude is below the level asm = 0.55 
implied by cluster normalization or by COBE normalization for its adopted O^ and shape of the 
power spectrum. It requires a large value of the bias factor (6 = 1.8) to match the rms fluctuation 
of the IRAS galaxies. This model has T = 0.25 and a tilted power spectrum with n = 0.803; it is 
similar to the E2(tilted) model of Cole et al. (1998), except that the rms fluctuation amplitude is 
(^8m = 0.40 instead of 0.55. 

O4MDb0.7 — An open universe with Q^ = 0.4 and Q\ = 0, in which the galaxy population as 
a whole traces the mass distribution. We select the IRAS galaxies using the morphology-density 
biasing relation. This model is cluster-normalized, and it requires the IRAS galaxies to be 
antibiased with respect to the mass, i.e., 6 < 1.0. 

O4SAb0.9 — An open universe with 0,rn = 0.4 and Oa = 0, in which the galaxies are selected 
from the mass distribution using the semi-analytic biasing model. Although this model is 
COBE-normalized by construction, it can simultaneously reproduce the observed mass function of 
clusters ( Pole et al. 1997 ). 



04SQEbl.O — An open universe with Om = 0.4 and I^a = 0, in which the IRAS galaxy density 
field is related to the mass density field by the square-root exponential biasing function. We 
choose the value of a = 0.041 so that b = 1.0 and the value of A so that Ug = 0.005/i^Mpc~^. 
In this model, the IRAS galaxies do not trace the mass distribution even though the bias factor 
b = 1.0. The parameters of this model are similar to those of O4SAb0.9, except that the biasing 
relation is very different. 

L3PLb0.62 — A flat universe with 0,^ = 0.3 and f^A = 0.7, in which the bias between galaxies and 
mass is described by the power-law bias model. This model is COBE-normalized, and it requires 
the IRAS galaxies to be antibiased with respect to the mass distribution. It can also reproduce 
the observed abundance of clusters at the present epoch ( Pole et al. 1997| ). 

We also reconstructed the PSCz catalog using another set of 9 models, which, together with 
the 6 models described above, extend our exploration of the Om, o'sm parameter space. These 9 
models are all either cluster-normalized, or COBE-normalized, or both. They are: 
ElPLbl.3 — An Einstein de-Sitter universe, in which the probability for a mass particle with 
local mass density pm to become a galaxy is given by the power-law biasing relation. We choose 
the value of B so that b = 1.3 and the value of A so that ng = 0.005/i^Mpc~'^. This model is both 
cluster-normalized and COBE-normalized, and it has a tilted power spectrum with n = 0.803 and 
r = 0.451. 

ElPLMDbl.3 — An Einstein de-Sitter universe, in which the galaxy population as a whole is 
biased using a power-law function. We choose the value of B so that agg ~ 1.0. We then use the 
morphology-density relation to select all the spiral galaxies as IRAS galaxies, so that agg ~ 0.7. 
The resulting bias factor of IRAS galaxies is 6 ~ 1.3. This model has the same mass power 
spectrum as ElPLbl.3. 

ElTHbl.3 — An Einstein de-Sitter universe, in which we select the galaxies from the mass 
distribution using the threshold biasing relation. We choose the threshold density B so that the 
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bias factor b = 1.3, and the probability A so that the mean galaxy density is Ug = O.OObh^ Mpc^ . 

This model has the same mass power spectrum as ElPLbl.3. 

O2PLb0.5 — An open universe with Q^ = 0.2 and Q\ = 0, in which the IRAS galaxies are selected 

from the mass distribution using the power-law biasing relation. This model is cluster-normalized, 

and it has the largest amplitude of mass fluctuations {agm = 1.44) among all of our 15 models. 

03PLbl.4 — An open universe with fi^ = 0.3 and 0,\ = 0, in which the IRAS galaxies are selected 

from the mass distribution using the power-law biasing relation. This is a COBE- normalized 

model. 

04UNbl.O — An open universe with Q^ = 0.4 and Oa = 0, in which the IRAS galaxies 

trace the mass distribution. Although this model is COBE-normalized by construction, it can 

simultaneously reproduce the observed mass function of galaxy clusters ( pole et al. 1997 ) . 



L2PLb0.77 — A flat universe with Qm = 0.2 and f^A = 0.8, in which there is a power-law biasing 
relation between the IRAS galaxies and the mass. This model is COBE-normalized, and it requires 
the IRAS galaxies to be antibiased with respect to the mass. 

L4PLb0.64 — A flat universe with Q^n = 0.4 and Qa = 0.6, in which there is a power-law biasing 
relation between the IRAS galaxies and the mass. This model is COBE-normalized, and it requires 
the IRAS galaxies to be antibiased with respect to the mass. This model can simultaneously 
reproduce the observed mass function of clusters ( Cole et al. 1997| ). 



L5PLb0.54: A flat universe with $7^ = 0.5 and 0,^ = 0.5, in which there is a power-law biasing 
relation between the IRAS galaxies and the mass. This model is also COBE-normalized, and it 
requires a strong antibias between the IRAS galaxies and the mass. 



3.3. Mock Catalogs 

If the reconstruction method were perfect, and structure in the universe really did form 
from gravitational instability of Gaussian initial conditions, then we would expect to reproduce 
exactly the galaxy distribution in the PSCz catalog, if we assumed the correct value of O^ 
and the correct biasing relation between IRAS galaxies and mass. However, the reconstruction 
method suffers from inaccuracies arising at various intermediate steps — inaccuracies in the bias 
mapping procedure, inaccuracies in correcting for the redshift-space distortions, inaccuracies in 
the dynamical recovery of the initial mass density fluctuations, and inaccuracies in the forward 
evolution step caused by poor modeling of the large scale tidal field and (on small scales) the finite 
numerical resolution. All these errors accumulate at various levels, with the result that we cannot 
expect a reconstruction to produce an exact match to the input data even if all of its assumptions 
are correct. It is therefore necessary to calibrate the magnitude of the errors intrinsic to the 
reconstruction method before we can derive any conclusions regarding the validity of the various 
assumptions entering the reconstruction procedure. 

We assess these errors by reconstructing a set of mock PSCz catalogs for each of the 15 
different models. For every model, we construct the mock PSCz catalogs from the outputs of 
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Table 1: Parameters of the 15 models used to reconstruct the PSCz survey. For each model, columns 
1 to 7 list the name of the model, the cosmological mass density parameter f^m, the cosmological 
constant Q\, the rms amplitude of the mass density fluctuations in 8h^^ Mpc spheres cgm) the 
cluster normalization parameter agm^'}^, the parameter /3iras = ^^/b where b = ctsiras / o'Sm is 
the bias parameter of IRAS galaxies, and the shape parameter T that specifies the shape of the 
linear mass transfer function (Efstathiou et al. 1992). A Y (N) in column 8 shows that the model 
satisfies (does not satisfy) the cluster constraint, while a similar notation in column 9 shows whether 
the model is COBE normalized. Column 10 lists the type of local biasing model used to select the 
IRAS galaxies from the underlying mass distribution. The final column specifies the name of the 
simulation in CIIWF98 from which we created the mock catalogs for each model. 
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COBE 


ElUNbl.O 


1.0 
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ElPLbl.3 
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0.0 


0.55 


0.55 


0.78 


0.45lt 


Y 


Y 
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N 
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N 
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Y 
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0.43 


0.61 
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Y 
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N 
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N 
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L3PLb0.62 
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0.55 
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L4PLb0.64 
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*Mock catalogs for all models were drawn from the simulations of CHWF98, except for the model ElPLbl.8, for which 
we created the mass distribution using a lower resolution PM simulation. See text for more details. 

'These models have a tilted inflationary power spectrum with n = 0.803, while all other models have a scale invariant 
(n — 1) inflationary power spectrum. 
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numerical simulations that have the appropriate values of Qm and bias. The geometry, the 
sky-coverage, the depth, and the selection function of the mock catalogs all mimic those of the 
original PSCz catalog. 

We construct the mock catalogs for 14 of the 15 models using the outputs of the N-body 
simulations of cold dark matter models performed by CHWF98. The CHWF98 simulations use 
a modified version of the AP3M code of Couchman (1991) to follow the gravitational evolution 
of 192^ particles in a periodic cubical box of side 345.6/i~^ Mpc, using a gravitational softening 
length of e = 90/i~^ kpc (for a Plummer force law), fixed in comoving coordinates. Further details 
of the simulations are in CHWF98. For the model ElPLbl.8, we created the mass distribution 
by evolving an initial density field with parameters similar to the E2 (tilted) model of CHWF98, 
except that asm = 0.4 instead of 0.55. We evolved 192^ particles on a 384^ force mesh using 
the PM code of Park (1990). Our goal here was to investigate an fi^ = 1-0 model with lower 
mass fluctuation amplitude than those considered by CHWF98, which is why we needed to run 
a new simulation. For the other 14 models, the 90h~^ kpc force resolution of the mock catalog 
simulation is much higher than the ~ lh~^ Mpc force resolution of the PM simulation used in 
the forward evolution step of the reconstruction procedure. Our calibration of systematic errors 
therefore includes the error caused by limited force resolution of the PM simulations. 

For every model (except ElPLbl.8), the last column in Table 1 lists the CHWF98 simulation 
from which we derive the mock catalogs. If the model involves bias, we start by selecting the 
galaxies from the mass distribution using the appropriate biasing algorithm. We then select 
"observers" from the galaxy distributions so that they satisfy the following observed properties of 
the Local Group: 

(1): The velocity of the Local Group observer should be in the range 550 km s~^ < Vlg < 

700 km s~^, consistent with the amplitude of the dipole anisotropy in the cosmic microwave 
background (Smoot et al. 1991). 



(2): The overdensity of galaxies in a spherical region of radius 5h Mpc centered on the Local 

Group observer should be in the range 1.0 < 1 + 6g{5h~^ Mpc) < 2.0 (Brown &: Peeble 
I 19871 ; [Hudson 199J ; jSchlegel et al. 1994|) . 



(3): The radial velocity dispersion in a sphere of radius 5h~^ Mpc around the Local Group 

observer should be less than 150 km s~^, consistent with the observations of a cold velocity 
field near the Local Group ([Sandage fc Tammann 1975 ; Sandage 1986| ; [Giraud 1986| ; Schlegel 



et al. 1994| ). We note that for all but one of the galaxy distributions (corresponding to the 



ElUNbl.O model), our Local Groups have local velocity dispersion smaller than 100 km s . 

(4): The Local Group particles for any pair of mock catalogs constructed from a simulation 

should be separated by at least 50/i~^ Mpc. This criterion ensures that the density fields in 
the mock PSCz catalogs centered on these observers are quite different from each other, at 
least within the volume-limiting radius Ri = 50h^^ Mpc. 
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We assign each particle in the galaxy distribution a redshift based on its real space distance 
and its radial peculiar velocity with respect to the Local Group observer particle. We assign 
luminosities to these galaxies consistent with the luminosity function of the IRAS galaxies in the 
PSCz catalog. We "observe" this galaxy distribution using the selection function of the PSCz 
survey. We reject all the galaxies in the angular regions not covered by the PSCz catalog, so that 
the sky coverage in the mock catalogs is identical to that of the true PSCz catalog. We create 10 
mock PSCz catalogs for each of the 15 models and reconstruct them in exactly the same manner 
as the PSCz catalog. 



4. PSCz Reconstruction: Illustrative Results 

We now describe the results of reconstructing the PSCz catalog and ten mock catalogs for the 
first 6 of the 15 models described in §3.2. Figure 3 shows a slice through the galaxy density fields 
of the true and the reconstructed PSCz catalogs. The density fields have been convolved with a 
Gaussian filter e~^' "^^^ with smoothing radius Rs = 4/i~^Mpc. The slices show the contours of 
the density field in the SGP. The galaxy density field traced by the galaxies in the PSCz catalog is 
shown in panel (a). Some of the prominent features include the Perseus-Pisces supercluster seen as 
the overdensity near the boundaries in the bottom right region, the Great Attractor region in the 
diagonally opposite direction near the top left corner, and the Local void in the bottom left region. 
We refer the reader to Branchini et al. (1999) for a detailed cosmographical description of the 
PSCz catalog. Panels (b) through (f) show the galaxy density fields reconstructed in the models 
ElUNbl.O, ElPLbl.8, O4MDb0.7, O4SAb0.9, and L3PLb0.62, respectively All the models can, at 
least qualitatively, reproduce the general features of the observed PSCz galaxy distribution. This 
success offers support to the hypothesis that structure formed from the gravitational instability 
of Gaussian primordial mass density fluctuations. We will see below that, although the various 
reconstructions resemble the observed PSCz galaxy density field in this visual representation, 
there are quantifiable differences between the accuracy of the reconstructions corresponding to 
different models. Thus, some models (like, for example, the models O4MDb0.7, O4SAb0.9 and 
L3PLb0.62) can reconstruct the PSCz catalog as well as can be expected based on the mock 
catalog reconstructions, while others (including the models ElUNbl.O and ElPLbl.8) fail in a 
systematic manner. 

Figure 4 shows the redshift-space locations of galaxies in the volume-limited PSCz catalog 
and its reconstructions. We plot the SGX and SGY coordinates of all the galaxies that lie in 
a slice 30/i~^Mpc thick centered on the SGP. The different panels show the true PSCz galaxy 
distribution and the various reconstructions, in the same format as Figure 3. 

One of the most obvious quantitative measurements of the success of a reconstruction is the 



23 



(a) PSCz 




(b) n=l, A = 0, b=1.0, UN 



-50 50 
(c) n=l, A=0, b=1.8. PL 




-40 
-60 

60 



-50 50 
(e) n = 0.4, A=0, b = 0.9, SA 




-50 



SGX [h-i Mpc] 




40 h 

20 



-20 

-40 

-60 

60 

40 

20 



-20 

-40 

-60 

60 

40 

20 



-20 

-40 

-60 



-50 50 
(d) n = 0.4, A=0, b = 0.7, MD 




-50 50 
(f) n = 0.3, A = 0.7. b = 0.62. PL 




-50 50 

SGX [h-i Mpc] 



Fig. 3. — Contours in the Supergalactic plane in the true and reconstructed PSCz density fields. 
The density fields are smoothed with a Gaussian filter of radius Rg = 4/i^^Mpc. Solid contours 
correspond to regions above the mean density p and are in steps of 0.1 in log(p/p), while dashed 
contours correspond to regions below the mean density and are in steps of 0.2 in (p/p). (a) The 
observed PSCz density field. Remaining panels show the density field reconstructed in the following 
models: (6) ElUNbl.O, (c) ElPLbl.8, (d) O4MDb0.7, (e) O4SAb0.9, and (f) L3PLb0.62. 
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Fig. 4. — Galaxy distributions in redshift space in the PSCz survey and its reconstructions. 
The panels show all the galaxies in a slice 30/i~^Mpc thick centered on the Supergalactic plane, 
and volume limited to a radius Ri = 50/i~^ Mpc. (a) The observed PSCz galaxy distribution. 
Remaining panels show the reconstructed galaxy distributions for the models listed in Fig. 1. 
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correlation coefficient r between the original and the reconstructed smoothed galaxy density fields, 

r.^^^, (U) 

(«?)' (SI)' 

where 6t and 5r are respectively the original and reconstructed smoothed galaxy density fields. 
Panels (a) through (f) of Figure 5 show the correlation coefficients for the models ElUNbl.O, 
ElPLbl.8, O4MDb0.7, O4SAb0.9, 04SQEbl.O, and L3PLb0.62, respectively. For every model, we 
assign ranks to the reconstructions of each of the 10 mock catalogs and to the reconstruction of 
the true PSCz catalog, in descending order of their values of r: the catalog whose reconstruction 
has the highest r value is assigned a rank of 0, the catalog whose reconstruction has the lowest 
r value is assigned a rank of 10, and so on in between. The solid line in each panel shows the 
values of r for the 10 mock catalog reconstructions of the model, in rank order. The horizontal 
dashed line shows the value of r for the PSCz reconstruction based on the model assumptions. We 
find that the absolute values of r tend to decrease for models with larger values of asm (smaller 
values of b) because the greater degree of non-linear gravitational evolution makes the recovery 
of initial conditions less accurate. The absolute value of r is therefore of little use for comparing 
the viability of different reconstruction models. We focus instead on the value of r relative to 
the values expected given the model assumptions, and since the PSCz reconstructions for all six 
of these models have a rank of seven or less (and the models therefore reproduce PSCz better 
than they reproduce at least three of their own mock catalogs), we conclude that all of them are 
acceptable by this particular measure. 

The correlation coefficient quantifies the success of the reconstruction in matching the 
observed galaxy density field at a particular scale, Rs = 4:h~^ Mpc. In order to probe a range of 
scales, Figure 6 shows the distribution of the Fourier difference statistic 



where the subscripts t and r refer to the true and reconstructed density fields respectively, and 
(5(k) represents the complex Fourier component of the density field. The summation is over all 
the waves with |k| in the interval (k — i,k]. This statistic measures the difference in both the 
moduli and the phases of the Fourier components of the true and reconstructed density fields, and 
it is independent of any smoothing of the density fields. It was ffist used by Little et al. (1991) 
to demonstrate the effects of power-transfer from large scales to small scales during non-linear 
gravitational evolution. When the complex amplitudes of the Fourier components of the true 
and reconstructed density fields are identical, D{k) = 0, while for two fields with uncorrelated 
phases, the average value of D{k) = 1. Figure 6 shows the value of D{k) averaged over the range 
of wavenumbers kguw < k < kg, where /csurv = 27r/(2 x Ri) = 0.0628 ZiMpc"^ is the wavenumber 
corresponding to the size of the reconstruction volume, and kg = 27r/(2 x 8) = 0.3927 /iMpc~^ is 
the wavenumber corresponding to the length scale of 8h^^ Mpc, approximately equal to the scale 
of non-linearity in the PSCz galaxy distribution. The different panels correspond to the six models 
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Fig. 5. — The correlation between the true and reconstructed smoothed galaxy density fields. The 
solid line in all the panels shows the distribution of the correlation coefficient (eq. ^) between 
the true and reconstructed smoothed galaxy density fields for the ten independent mock catalogs, 
while the dashed line shows the same quantity for the PSCz reconstruction. The different panels 
show the galaxy distributions reconstructed with the following assumptions: (a) ElUNbl.O, (6) 
ElPLbl.8, (c) O4MDb0.7, (d) O4SAb0.9, (e) 04SQEbl.O, and (/) L3PLb0.62. Note that the range 
of the vertical axis varies from one panel to another. 
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in the same format as Figure 5. The sohd hne in each panel shows the distribution of D{k) in the 
ten mock catalog reconstructions of that model. The dashed line shows the value of D(k) in the 
PSCz reconstruction. We rank the mock catalog reconstructions and the PSCz reconstruction, in 
ascending order of their values of D{k). We find that the PSCz reconstruction has a high rank 
(of 9) in the model ElUNbl.O, and has smaller ranks in all the other models. Hence, the model 
ElUNbl.O fails (though only at the 80% confidence level) to reproduce the PSCz density field as 
measured by this statistic, while the other five models yield acceptable reconstructions. 

Figure 7 shows the PDF of the true and reconstructed galaxy density fields for the six 
models. We compute the PDF of a density field after smoothing it with a Gaussian filter of radius 
Rs = 4/i^^ Mpc. In every panel, the crosses and the thin solid line show the PDFs of the true and 
reconstructed galaxy density fields for a typical mock catalog — the one with rank 5 according 
to the Figure-of-Merit (FOM) for this statistic, defined as the maximum value of the absolute 
difference between the cumulative distributions Ci(i^) and Cri^) of the true and reconstructed 
galaxy density fields, 

FOMpDF = max \Ct{y) - Cr{y)\. (13) 

This is the FOM that would be used in a Kolmogorov-Smirnov comparison of the PDFs, and we 
find that it gives similar results in terms of ranks to a FOM based on absolute differences of the 
differential PDFs. The open circles and the thick solid line show the true and reconstructed PDFs 
for the PSCz reconstruction, offset vertically by 0.2 for the sake of clarity. 

For every model, we rank the mock catalog reconstructions and the PSCz reconstruction, in 
increasing order of the FOM of the statistic. If the reconstruction of PSCz based on the model 
assumptions is worse than expected from the mock catalog tests, then the PSCz reconstruction will 
have a high rank. A low PSCz rank, conversely, implies a reconstruction that is successful given 
the expectations from the mock catalog tests. Visual comparison between the PDF recoveries for 
PSCz and for the rank-5 mock catalogs in Figure 7 suggests that the PSCz reconstructions for 
the models ElUNbl.O, ElPLbl.8, O4MDb0.7, and 04SQEbl.O should have high ranks, while the 
PSCz reconstructions for the models O4SAb0.9 and L3PLb0.62 should have low ranks. This is 
indeed the case, as can be verified by the PSCz ranks listed in each panel. 

We show the ranks for all other statistics in Figures 8 through 12, in the same format as in 
Figure 7 for the PDF statistic. If the PSCz catalog has a rank of 5 for any of the statistics, we 
show the results for the mock catalog ranked 6 according to that statistic. We will use the ranks 
for all the statistics as the basis for a more systematic evaluation of models in §5. 

Figure 8 shows the distribution of galaxy counts in spheres of radius 8/i^^ Mpc, in the true 
and reconstructed galaxy distributions for the six models. We compute this distribution by 
placing 50, 000 spherical cells at random locations within the reconstruction volume and counting 
the number of galaxies within each cell. If a cell lies close to the boundary of the survey region, 
we include it in the distribution only if at least 90% of its volume lies within the survey region. 
The crosses and the thin solid line show the distributions of counts in the true and reconstructed 
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Fig. 6. — The Fourier difference statistic D{k) between the true and the reconstructed galaxy 
density fields (eq. |l2|). The solid line in all the panels shows the distribution of the value of D{k) 
averaged over the range of wavenumbers kgurv < A; < /cg in the ten independent mock catalogs, 
while the dashed line shows the same quantity for the PSCz reconstruction. The different panels 
correspond to the galaxy distributions reconstructed using the various models listed in Fig. 4. Note 
that the axis range varies from one panel to another. 
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Fig. 7. — PDF of the true and reconstructed galaxy density fields. All the density fields are 
smoothed with a Gaussian filter of radius Rg = 4/i~^Mpc. The crosses and the thin solid line 
show the PDFs of the true and reconstructed smoothed density fields of a mock catalog, the one 
with rank 5 according to the FOM for this statistic defined in equation (^). The open circles 
and the thick solid line show the same quantities for the PSCz catalog, and are vertically offset 
by 0.2 for clarity. The different panels correspond to the galaxy distributions reconstructed using 
the various models listed in Fig. 4. A successful model should reproduce the PSCz (i.e., show 
agreement between the circles and thick line) about as well as it reproduces its own mock catalogs 
(as illustrated for a typical case by the crosses and thin line). Each panel lists the rank of the PSCz 
reconstruction based on the model assumptions, relative to the reconstructions of the model mock 
catalogs, according to the FOM for this statistic. 
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galaxy distributions of the mock catalog with rank 5. The open circles and the thick solid line 
show the same quantities for the PSCz catalog, and are vertically offset by 0.05 for clarity. We 
define the FOM for this statistic as 

N=oo 

FOMcouNTS= E \PtiN) - PriN)\, (14) 

N=l 

where Pt{N) and Pr{N) are the distributions of the counts in cells in the true and reconstructed 
galaxy distributions. From visual inspection of Figure 8, we would expect the models ElUNbl.O, 
ElPLbl.8, and 04SQEbl.O to have high ranks and the models O4MDb0.7, O4SAb0.9, and 
L3PLb0.62 to have low ranks, as is indeed verified by the ranks of the PSCz reconstruction listed 
in different panels. We also computed this distribution using spherical cells of radius 3h~^ Mpc, 
but do not show the corresponding figure. Although the distribution of galaxy counts is a 
measure similar to the PDF statistic shown in Figure 7, here we are using different smoothing 
filters (top-hat instead of Gaussian) and smoothing lengths (3/i~^ Mpc and 8h^^ Mpc instead of 
4/i-i Mpc). 

Figure 9 shows the void probability function (VPF) in the true and reconstructed galaxy 
distributions for the six models. Like the PDF and the count distributions, this statistic is 
sensitive to higher-order correlations in the density field ( White 1979| ; Balian fc Schaeffer 1989 ; 



Sheth 1996 ), and it can distinguish between biased and unbiased galaxy formation models ( [Little 



fc Weinberg 1994 ). The probability Po{R), that a randomly placed sphere of radius R is devoid of 
galaxies is a subset of the more general count distribution statistic Pn{R), but here we examine 
Pq at a range of R instead of P/v at fixed R = 8h^^ Mpc, as in Figure 8. When computing the 
VPF, we require that at least 90% of the spherical cell's volume lie within the survey region. We 
define the FOM for this statistic as 

R=oo 

FOMvPF = E \PotiR) - Por{R)\, (15) 

R=0 

where PotiR) and Por(-R) are the VPFs of the true and reconstructed galaxy distributions, and the 
sum extends over discrete bins in R. By visual comparison with the mock catalog reconstructions, 
we expect the models ElUNbl.O, ElPLbl.8, O4MDb0.7, and 04SQEbl.O to have high ranks. This 
expectation is confirmed by the ranks of the PSCz reconstruction listed in each panel. We also 
computed the underdensity probability function (UPF) introduced by Weinberg & Cole (1992) 
and found that the different models have similar ranks for the UPF as for the VPF. The UPF 
requires that a sphere be more than 80% below the mean density rather than completely empty. 

Figure 10 shows the distribution of distances to nearest neighbors in the true and reconstructed 
catalogs. If computed in three dimensions using galaxy redshift distances, this statistic would show 
a spurious peak at neighbor separations corresponding to the velocity dispersions of typical galaxy 
groups. Therefore, we instead estimate the nearest-neighbor distribution from the redshift-space 
galaxy distributions using the method suggested by Weinberg &; Cole (1992). For every galaxy 
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Fig. 8. — Distributions of galaxy counts in spheres of radius 8h~^ Mpc in the true and reconstructed 
galaxy distributions. The crosses and the thin sohd hue show the counts in cehs of the true and 
reconstructed galaxy distributions of the mock catalog ranked 5 according to the FOM for this 
statistic (eq. |l^. The open circles and the thick solid line show the same quantities for the PSCz 
catalog, and are vertically offset by 0.05 for clarity. The different panels correspond to the galaxy 
distributions reconstructed using the various models listed in Fig. 4. 
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Fig. 9. — Void probability functions (VPF) of the true and reconstructed galaxy distributions. The 
crosses and the thin solid line show the VPFs of the true and reconstructed galaxy distributions 
of the mock catalog ranked 5 according to the FOM for this statistic (eq. p^ ). The open circles 
and the thick solid line show the same quantities for the PSCz catalog, and are vertically offset by 
0.5 for clarity. The different panels correspond to the galaxy distributions reconstructed using the 
various models listed in Fig. 4. 
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at a redshift z, we consider all the galaxies that lie within a redshift range Av < 1000 kms~^ to 
be its potential nearest neighbor. Of these candidate neighbors, we then choose the galaxy that 
lies closest to this galaxy in the transverse direction, and we compute the distribution of this 
transverse separation Rt divided by the mean inter-galaxy separation d (i.e., Xn = Rt/d). This 
approach biases the estimated neighbor distance, but the bias is the same for the PSCz data and 
the reconstructions. We define the FOM for this statistic as 

x„=l 

FOMnNBR= E \Pt{Xn) - Pr{Xn)\, (16) 

a;„=0 

where Pt{xn) and Pr{xn) are the nearest-neighbor distributions of the true and reconstructed 
galaxy distributions. From the ranks of the PSCz reconstruction in different panels, we find that 
the models ElPLbl.8 and 04SQEbl.O have high ranks, while the other models have low ranks. 

Figure 11 shows the redshift-space correlation functions S,{s) for the true and reconstructed 
catalogs. We compute the correlation functions using the estimator of Hamilton (1993), 

^. ^ NddNrr 

where Nuu, N^r, and Njir are the number of galaxy-galaxy, galaxy-random, and random-random 
pairs with a redshift-space separation s. We use a random catalog that has the same geometry 
and selection function as the PSCz catalog and contains 50,000 points distributed randomly 
within the survey volume. We consider only those galaxy pairs that subtend an angle smaller 
than amax = 60° at the observer so that the lines of sight to both the galaxies in the pair are 
approximately parallel. We fit the correlation function in the region l/i~^ Mpc < s < 15h^^ Mpc 
with a power-law of the form ^(s) = ( 7- ) , where sq is the redshift-space correlation length and 7 
is the index of the power-law. We define the FOM as 

F0M^ = |7t-7^|, (18) 

where 7t and jr are the slopes of the true and reconstructed, redshift-space correlation functions. 
We find that the model ElUNbl.O has a high rank, while the other five models have low ranks. 
The failure of the model ElUNbl.O is the one expected if we reconstruct a low fi^ universe using 
a high value of flm'- the high velocity dispersion in an Q^ = 1 model leads to excessive suppression 
of ^{s) on small scales, while the large amplitude of the coherent bulk motions ( Kaiser 1987| ) 



boosts it on large scales. Thus, the reconstructed ^(s) has a shallower slope compared to the true 
^(s). We investigated a number of alternative FOM definitions, but we found (using the mock 
catalogs rather than the PSCz data set itself) that the difference in slopes was the most effective 
measure for picking out the characteristic signature of excessive redshift space distortions. 

The peculiar velocities of galaxies affect the redshift space clustering on both small and large 
scales, as discussed in §2.2. However, the real to redshift space mapping does not affect the 
galaxy clustering perpendicular to the line-of-sight. Figure 12 shows the projected correlation 
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Fig. 10. — Distribution of distances to nearest neighbors in the true and reconstructed galaxy 
distributions. We compute this distribution from the redshift-space locations of galaxies using 
the method described in the text. The thin dashed and solid lines show the nearest neighbor 
distributions of the true and reconstructed galaxy distributions of the mock catalog ranked 5 
according to the FOM for this statistic (eq. |l6|). The thick dashed and solid lines show the same 
quantities for the PSCz catalog, and are vertically offset by 0.7 for clarity. The different panels 
correspond to the galaxy distributions reconstructed using the various models listed in Fig. 4. 
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Fig. 11. — The redshift-space correlation function, ^(s), of the true and reconstructed galaxy 
distributions. The thin dashed and solid lines show ^(s) of the true and reconstructed galaxy 
distributions of the mock catalog ranked 5 according to the FOM for this statistic (eq. ^). The 
thick dashed and solid lines show the same quantities for the PSCz catalog, and are vertically offset 
by 0.5 for clarity. The different panels correspond to the galaxy distributions reconstructed using 
the various models listed in Fig. 4. 
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function w{rp) (Davis & Peebles 1983; Fisher et al. 199^ ) of the true and reconstructed galaxy 



distributions, computed using an estimator similar to the one defined in equation (|T^. Here, the 
transverse separation r^ is defined by the relation r^ = s^ — vr^, where s is the true separation in 
redshift-space and vr is the separation along the line-of-sight between the two galaxies in a pair. 
We fit a power-law to this function in the range lh~^ Mpc < r^ < 15/i^^ Mpc, and define a FOM 



similar to that defined by equation (18). We find that the model 04SQEbl.O has a high rank. 



while the other five models have low ranks. 



5. Evaluation of Models 

We also reconstructed the PSCz catalog and the corresponding mock catalogs for the 
remaining set of nine models described briefly in §3.2. For all 15 models, we measured all the 
statistics described in the last section. We then ranked these models using the FOM corresponding 
to each statistic, in the manner described in §4. Table 2 lists the ranks of the PSCz reconstruction 
with respect to the mock catalogs for our full suite of 15 models. 

Table 2 is the complete quantitative summary of the results of our reconstruction analysis of 
the PSCz catalog. A low rank for any statistic indicates that the model reproduces that statistical 
property of the PSCz catalog as well as, or better than, it reproduces that property for most of 
the mock catalogs corresponding to that model. On the other hand, a high rank (close to 10) 
for any statistic indicates that the model does not reproduce that property of the PSCz catalog 
as accurately as would be expected (based on the mock catalogs) if the model were a correct 
representation of the real universe. Computational practicality limits us to ten mock catalogs for 
each of our models, so even if the PSCz reconstruction has a rank of 10 for a particular statistic, 
we can only conclude that the model fails that statistical test at the ~ 90% confidence level. If we 
were to reconstruct 100 mock catalogs for that model, we would expect the PSCz reconstruction 
to be worse than at least ~ 90 of the mock catalogs (unless we happened to be unusually lucky in 
the ten that we did reconstruct), but we do not know whether it would be worse than 95, or 99, 
or all 100, since we have not been able to probe the tails of the reconstruction error distribution. 

Two issues complicate the interpretation of Table 2. First is the fact that we have considered 
many different statistical tests and therefore given the PSCz reconstruction many "chances to 
fail". As a result, a single rank of 10 does not necessarily imply a failure of the model; if the nine 
statistics were entirely independent of each other (which they are not), we would expect a typical 
mock catalog to have one rank of 10, and a significant fraction to have more than one rank of 10. 
In order not to be misled, we must compare the ranks of the PSCz reconstruction with the ranks 
of the mock catalog reconstructions even when we draw general inferences from Table 2, as we do 
below. 

The second complication is that the statistical measures are not all independent of each other, 
since the clustering properties that they quantify are in some cases closely related. Fortunately, 
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Fig. 12. — The projected correlation function, ■w{rp), of the true and reconstructed galaxy 
distributions. The thin dashed and solid lines show w{rp) of the true and reconstructed galaxy 
distributions of the mock catalog ranked 5 according to the FOIVI for this statistic (eq. ^). The 
thick dashed and solid lines show the same quantities for the PSCz catalog, and are vertically offset 
by 0.5 for clarity. The different panels correspond to the galaxy distributions reconstructed using 
the various models listed in Fig. 4. 



38 



we can use the mock catalogs themselves to understand the correlations between the different 
statistics. Using all 150 mock catalogs, we computed the covariance matrix of the ranks of the 
nine statistics, and we also computed the distribution of mock catalog ranks for each statistic 
conditioned on the catalog having a rank of 10 for one of the other statistics. Both analyses 
led to the same conclusion: the nine statistics fall into five groups, and ranks within each group 
are correlated but ranks in one group are essentially uncorrelated with ranks in another group. 
The five groups are: (1) the correlation coefficient (r) and the Fourier difference statistic D{k), 
(2) the PDF of the smoothed galaxy density field, (3) the counts in spheres of radii 3/i~^ Mpc 
and 8h~^ Mpc and the void probability function, (4) the nearest neighbor distribution, and (5) 
the two correlation functions ^(s) and w{rp). The statistics in the first and third groups are 
strongly correlated amongst themselves, while the statistics in the fifth group (the two correlation 
functions) are only moderately correlated. 

As an overall quantitative measure of the success of a PSCz reconstruction relative to the 
expectation based on mock catalogs, we list the weighted mean rank of the PSCz reconstruction 
in column 11 of Table 2. We weight the rank of each statistic inversely by the number of statistics 
in its correlated group, so each of the five independent groups contributes equally to this mean 
rank. The mean weighted rank of mock catalogs computed in this manner is 5.0, so a PSCz 
reconstruction with mean rank greater than 5.0 is less accurate than a reconstruction of a typical 
mock catalog, and vice versa. 

Two of the models listed in Table 2 fail the reconstruction test unambiguously. The PSCz 
reconstruction for model 04UNbl.O has a rank of 10 in each of three independent groups of 
statistics, the PDF, the counts/VPF, and the nearest neighbor distribution. The worst of the ten 
mock catalog reconstructions of this model has two independent ranks of 10, while the next worst 
has one rank of 10 and two ranks of 9. The 04SQEbl.O model fails even more clearly. The PSCz 
reconstruction of this model has a rank of 10 in four of the five independent groups of statistics, 
while the worst mock catalog reconstruction for this model has one rank of 10 and one rank of 9. 
For both models, the weighted mean rank of the PSCz reconstruction is higher than that of any of 
the model's ten mock catalog reconstructions. Remarkably, the O4SAb0.9 model, which has nearly 
the same cosmological parameters as these two failed models but a different form of the biasing 
relation, produces one of the most successful PSCz reconstructions. We return to this point in §6. 

The other model that fares especially poorly in Table 2 is ElUNbl.O, which has two 
independent ranks of 10 and a rank of 9 in third independent group. One of the mock catalog 
reconstructions of this model actually performs worse, with three independent ranks of 10 and a 
fourth independent rank of 9, and this mock catalog has a weighted mean rank of 7.7, identical 
to the PSCz mean rank of 7.7. In a purely statistical sense, therefore, we cannot rule out this 
model as clearly as we can rule out 04UNbl.O and 04SQEbl.O. However, as already noted in 
our discussion of Figure 11a, the ElUNbl.O reconstruction of PSCz fails in exactly the manner 
expected if we reconstruct a low Vim universe with an ilm = 1 model of similar a^m'- the high 
peculiar velocities in the O^ = 1 reconstruction suppress ^(s) on small scales and boost it on large 
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scales, making the reconstructed slope of ^(s) too shallow. The PSCz reconstruction has a rank 
of 10 for ^(s) but only 6 for w{rp), so it is clear that this failure is caused by the reconstruction's 
excessive peculiar velocities, not by a problem in the real space clustering. None of the mock 
catalog reconstructions of this model, including the one that has more high ranks than the PSCz 
reconstruction, fails in this characteristic manner. 

Based on these considerations, we classify the models 04UNbl.O, 04SQEbl.O and ElUNbl.O 
as Rejected according to our analysis, and we indicate this classification by an R in column 12 of 
Table 2. These three PSCz reconstructions have the highest mean ranks among the 15 models, 7.8, 
7.8 and 7.7, respectively. We classify the remaining 12 models as Accepted (indicated by an A in 
column 12), since in each of case there is at least one of the model's mock catalogs that has more 
independent ranks of 9 or 10 than the model's PSCz reconstruction. However, within this class 
of Accepted models, there is a wide range in the relative accuracy of the PSCz reconstruction. 
The models ElPLbl.3, ElPLbl.8, and L2PLb0.77 are all Accepted on the basis of a single mock 
catalog that is reconstructed worse than the PSCz catalog, while the models ElPLMDbl.3 and 
03PLbl.4 are Accepted on the basis of two mock catalogs that are reconstructed worse than the 
PSCz catalog. These five models have the highest mean ranks among the 12 Accepted models. 
In each of the remaining seven Accepted models, the PSCz is reconstructed better than at least 
three mock catalogs, and these models have correspondingly smaller average ranks for their PSCz 
reconstructions . 



6. Discussion 

We have reconstructed the IRAS galaxy distribution in our cosmological neighborhood, within 
a spherical region of radius 50/i^^ Mpc centered on the Local Group. We have tested 15 different 
models, each consisting of a set of assumptions about the values of fi^ and I^A) the value of the 
bias factor b, and the nature of the biasing relation between IRAS galaxies and the underlying 
mass. For every model, we have quantified the accuracy of the PSCz reconstruction relative to 
the expectation based on mock PSCz catalogs, and we have used this result to classify the model 
as Accepted or Rejected. The Rejected models are unlikely to be the correct models for structure 
formation in the real universe, while for the Accepted models the PSCz reconstruction is more 
accurate than at least one of the model's ten mock catalog reconstructions. We have computed 
mean weighted ranks (Table 2, column 11) as an overall quantitative measure of the accuracy of 
a model's PSCz reconstruction relative to the expectation from mock catalogs. We now examine 
these results in detail to see what general conclusions we can derive regarding the allowed ranges 
of cosmological and galaxy formation parameters. 

Figure 13 shows the locations of all 15 models in the Qm — <^Sm plane. The 12 distinct 
points correspond to the 15 different models because there are sets of models with identical values 
of Vim and (Tgm (and hence b) but with different biasing schemes. Thus, for example, the two 
models 04UNbl.O and 04SQEbl.O are indistinguishable in this plane, as are the three models 
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Table 2: Rank of the PSCz reconstruction with respect to the mock catalogs, for the full suite 
of 15 models. Column 1 gives the name of the model. Column 2 lists the rank of the PSCz 
reconstruction according to the correlation between the true and reconstructed smoothed galaxy 
density fields (r), while the number enclosed in brackets shows the absolute value of the correlation 
coefficient. For each model, columns 3 to 10 list the rank of the PSCz reconstruction with respect 
to the corresponding mock catalogs, for the following statistical properties: the Fourier difference 
statistic D{k), the PDF of the smoothed galaxy density field, the galaxy counts in spheres of radii 
3h^^ Mpc and 8h^^ Mpc, the void probability function (VPF), the nearest neighbor distribution 
(NNBR), the redshift-space correlation function ^{s), and the projected correlation function w{rp). 
Column 11 lists the weighted mean rank for the PSCz reconstruction computed using the procedure 
described in the text. Column 12 lists our qualitative classification, Accepted(A) or Rejected(R), 
as described in the text. In all cases, the rank of the PSCz is the number of mock catalogs (out 
of 10) that are reconstructed more accurately than the model reconstructs PSCz, according to the 
FOM for the corresponding statistic. 
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ElPLbl.3, ElPLMDbl.3, and ElTHbl.3. The circles show the 12 Accepted models, and the 
triangles show the three Rejected models. For the Accepted models, the radius of the circles is 
proportional to (10 — (Rank)), where (Rank) is the mean rank for the PSCz reconstruction of a 
model. Hence, larger circles show models that are more successful in reconstructing the PSCz 
catalog. The shaded region shows the observed rms fluctuation of the IRAS galaxy distribution, 
cJ8g(IRAS) = 0.69 ± 0.04 QFisher et al. 1994|) . 



We plot four different constraints in the 0,m — csm plane that are obtained using independent 
techniques. The solid line in Figure 13 shows the constraint agm^^ = 0.55, required to reproduce 
the observed masses and abundances of rich clusters of galaxies at the present epoch ( [White et al 



19931 ; [Eke et al. 199^ ; [Viana fc Liddle 1996| ). The dotted hue shows the constraint crsm^m^ = 0.85, 



which is implied by the power spectrum of mass density fluctuations estimated from the peculiar 
velocities of galaxies in the SFI catalog ( Freudling et al. 1999[ ). The remaining two constraints 



arise from comparing the IRAS galaxy distribution with the peculiar velocities of galaxies. In 
linear perturbation theory, the mass continuity equation takes the form ( Peebles 1980| ) 



V . V = -fHo6m, (19) 

where / ?» ^^- If we assume that 5g = bs6m (the linear bias model), equation ([T9|) becomes 

V-v = -pHo6g, (20) 

where /3 = Q^/bs- The short dashed line shows the constraint /?iras = 0.5 obtained by the 
VELMOD method, which derives a maximum likelihood estimate of /3iras by comparing the 
peculiar velocities of galaxies in the Mark III catalog with their radial velocities predicted from 
the IRAS 1.2 Jy redshift catalog, using equation (||) ( [Willick et al. 19974 [Wilhck &: Strauss 



1998). This value of /3iras is also obtained from an analysis of the anisotropy of the redshift-space 
power spectrum of IRAS galaxies (Cole, Fisher, &; Weinberg 1995), and from a comparison of 
the spherical harmonics of the peculiar velocity field derived from the Mark III catalog with the 
spherical harmonics of the gravity field derived from the IRAS 1.2 Jy survey (Davis, Nusser, & 
Willick 1996; see Strauss & Willick 1995 and Hamilton 1998 for reviews of other estimates of 
Aras)- We convert an estimate of /3iras into a constraint on a^rn^'^ using the relation 

(^8m^m = ArASC^Sc,, (21) 

where we have assumed that bs = b = crsg/crsm- The long dashed line shows the constraint 
Aras = 0.9 obtained by the POTENT method, which measures /?iras as the slope of the 
regression between the observed galaxy density field from the IRAS 1.2 Jy redshift catalog and the 
mass density field derived from the peculiar velocities of galaxies in the Mark HI catalog, using a 
modified version of equation ([l9[) (Sigad et al. 1998). Each of these four constraints has a quoted 



uncertainty of about 10% — 20%, implying that all of them cannot be consistent with one another. 

Based on the ranks of the 15 models in Table 2, and their locations in Figure 13, we arrive at 
the following conclusions. 
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Fig. 13. — Points show the locations of the 15 models in the $7^ — ^Sm plane, classified as Accepted 
(circles), or Rejected (triangles). The radius of the circles is proportional to (10 — (Rank)), where 
(Rank) is the mean rank for the PSCz reconstruction of a model; larger circles represent more 
successful reconstructions. The lines show constraints in this plane derived from four different 
techniques: (1) asm^^ = 0.55 from the abundance of clusters at z = (solid line), (2) 
Cgm^m^ = 0.85 from a maximum likelihood estimate of the mass power spectrum using peculiar 
velocity data (dotted line), (3) /3iras = 0.5 from the VELMOD analysis of the Mark III catalog 
of peculiar velocities and the IRAS 1.2 Jy galaxy redshift catalog (short dashed line), and (4) 
PiRAS = 0.9 from the POTENT analysis of the Mark III catalog of peculiar velocities and the IRAS 
1.2 Jy galaxy redshift catalog (long dashed line). The shaded region shows the observed fluctuation 
amplitude of the IRAS galaxy distribution, cr8g(IRAS) = 0.69 ± 0.04. 
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(1): Our successful reconstructions of the PSCz catalog, at least for some plausible assumptions 
about the value of Qm and the bias between IRAS galaxies and mass, lends support to 
the hypothesis that LSS originated in the gravitational instability of small amplitude, 
Gaussian primordial mass density fluctuations. While this success does not, by itself, rule 
out non-Gaussian models for primordial fluctuations, it strengthens the viability of Gaussian 
models. Models whose initial conditions have substantially non-Gaussian PDFs generally 
predict quite different properties for LSS ( |Moscardini et al. 1991 ; Weinberg fc Cole 1992 ). 



(2): Unbiased models in which IRAS galaxies trace mass are rejected, for both 0^ = 0.4 and 
Qrn = 1- From Table 2 and the discussion in §5, it is clear that the models ElUNbl.O and 
04UNbl.O are both clearly rejected by the reconstruction analysis of the PSCz catalog. 
Figure 11(a) shows that the model ElUNbl.O fails in the manner expected if we reconstruct 
the redshift-space galaxy distribution in a low $7^ universe using, erroneously, a high value 
of Qm- The high velocity dispersion of clusters in an 0^ = 1 model suppresses the small 
scale correlations in redshift space, while the large scale bulk flows, whose amplitude is 
proportional to fij^^, enhances the correlations on large scales. Therefore, the reconstructed 
^(■s) has a shallower slope compared to the true ^(s), although the rms fluctuations (in 
redshift space) of the two galaxy distributions are similar, by construction. 

(3): Of the five models with fi^ = 1) ElTHbl.3 is the only Accepted model that reconstructs 
the PSCz catalog as well as its own typical mock catalog. While the model ElUNbl.O 
is clearly Rejected, the models ElPLbl.3 and ElPLbl.8 are Accepted because one mock 
catalog in each of these models is reconstructed worse than the PSCz catalog, and the model 
ElPLMDbl.3 is Accepted because there are two mock catalogs that are reconstructed worse 
than the PSCz catalog. Thus, although four of the five models with 0^ = 1 are Accepted, 
most of them are only moderately successful in reconstructing the PSCz catalog. 

(4): In Figure 13, there are five models that lie on, or close to, the constraint /3iras = 0.5. Of 
these, the models 04UNbl.O and 04SQEbl.O are clearly rejected, the models ElPlbl.8 and 
L2PLb0.77 are Accepted because there is one mock catalog that is reconstructed worse 
than the PSCz catalog, while the model 03PLbl.4 is Accepted because there are two mock 
catalogs that are reconstructed worse than the PSCz catalog. All these models are at best 
only moderately successful in reconstructing the PSCz catalog. This leads us to conclude 
that the low-normalization constraint /Siras = 0.5 (corresponding to asm^^ ~ 0.35), 
inferred from the simplest interpretation of the VELMOD ( [Willick fc Strauss 1998| ) and 
redshift-space distortion ( pole et al. 1995 ) analyses of the 1.2Jy redshift survey, is only 



marginally successful in reconstructing the PSCz catalog. Here we have assumed that 
bs = b = (Tsg/<J8m to convert an estimate of /3iras into a constraint on agm^^- While 
this relation is valid in a deterministic, linear bias model, in the case of a more general 
biasing relation between galaxy and mass distributions, the relation between b and bs also 
includes terms arising from the non-linearity and the stochasticity of the biasing relation 
( Dekel Sz Lahav 19991) . We are currently investigating the extent to which the estimates of 
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Aras using different techniques, including POTENT, VELMOD, and the anisotropy of the 
redshift-space power spectrum, are sensitive to the details of the biasing scheme (Berlind, 
Narayanan, & Weinberg, in preparation). 

(5): There are seven models including ElTHbl.3, O2PLb0.5, O4MDb0.7, O4SAb0.9, L3PLb0.62, 
L4PLb0.64, and L5PLb0.54, in which the PSCz catalog is reconstructed better than at least 
three mock catalogs corresponding to that model. These models are the most successful in 
reconstructing the properties of the galaxy distribution in the PSCz catalog. Except for 
the model ElTHbl.3, all these models have Qm < 1 and require that asm > crsg (hence 
b < 1), i.e., that IRAS galaxies be antibiased with respect to the mass distribution on a 
scale of 8h^^ Mpc. However, we are unable to pin down the bias factor more precisely: the 
model O4SAb0.9, with a small antibias, and the model O2PLb0.5, with a large antibias, 
both reconstruct the PSCz catalog very well. Most of the successful models require that 
/SiRAS ^ 0.8 (except the model O4SAb0.9, which requires /3iras ~ 0.7) 

(6): The model O4SAb0.9, in which IRAS galaxies are related to the mass distribution according 
to the predictions of the semi-analytic galaxy formation model, reconstructs the PSCz 
catalog very well. This accurate reconstruction of the PSCz catalog is a non-trivial success 
of the semi-analytic model, since the models 04UNbl.O and 04SQEbl.O, with quite similar 
values of ^m and a^rn but with different biasing relations, are both clearly rejected. This 
sensitivity of the reconstruction to the nature of the biasing relation demonstrates that the 
reconstruction analysis of a galaxy redshift survey can distinguish between different bias 
models, and not just between different values of the bias factor b. 

Conclusions (2) — (6) are based on reconstructing the PSCz catalog under the general 
assumptions that the primordial mass density fluctuations form a Gaussian random field and 
that the bias between IRAS galaxies and mass can be characterized by a local, monotonic 
function. While the assumption of Gaussian initial fluctuations enables us to constrain the nature 
of the biasing relation, we could also use reconstruction analysis in a complementary mode, 
i.e., to test the level of non-Gaussianity of the initial fluctuations, given our current state of 
knowledge of the galaxy formation process. In this regard, the successful reconstruction for a 
model (O4SAb0.9) based on a physically motivated theory of galaxy formation and cosmological 
parameter values supported by independent constraints supports the standard hypothesis that 
primordial fluctuations were not far from Gaussian. 

There are several natural directions for extending this analysis using observational data. In 
this paper, we have compared the properties of the reconstruction to the input PSCz galaxy 
distribution in redshift space alone. However, every model reconstruction predicts both the 
real-space galaxy distribution and the fully non-linear peculiar velocity field at every point within 
the reconstruction volume. We can then compare the velocity field predicted for any model with 



the observed peculiar velocities of galaxies in, say, the Mark HI catalog ( Willick et al. 1997b ) 



or the SFI catalog ( Giovanelli et al. 1997| ). Such a comparison will be more accurate than a 
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comparison involving the velocity field predicted using linear theory. The amplitude and the 
non-linear components of the velocity field serve as good diagnostics of the allowed values of 
Clm and o"8m ( [Narayanan fc Weinberg 1998| ). We can correct for inhomogeneous Malmquist bias 
when comparing the observed density and velocity fields by using the reconstructed line-of-sight 
density and velocity distributions to every galaxy. Alternatively, we can circumvent the effects of 
Malmquist bias by working directly in redshift space itself ( [Strauss fc Willick 1995 ). 



In order to understand the galaxy formation process, it is necessary to study the relative bias 
between different types of galaxies as well as the absolute bias between the galaxy population as 
a whole and the underlying mass. For example, it is now well known that optical galaxies are 
more strongly clustered than IRAS galaxies ( Lahav et al. 199C ; Strauss et al. 199^ ; ^aunders et 



al. 1992[ ; [Peacock fc Dodds 1994| [Willmer et al. 1998| ; [Baker et al. 1998| ). Reconstruction analysis 



of the galaxy distribution in the Optical Redshift Survey (Santiago et al. 1995, 1996), using a 
set of models similar to the ones discussed in this paper, will give us independent constraints on 
^m and (Tgrn and enable us to test whether optical galaxies trace the underlying mass. Since the 
Optical Redshift Survey and PSCz probe similar regions of space, the initial conditions derived 
from reconstruction of the two data should be consistent with each other, and it should be possible 
to reproduce one catalog beginning with the initial conditions derived from the other by changing 
only the biasing model used to select galaxies from the evolved mass distribution. 

Reconstruction analysis is thus a powerful tool to constrain the ranges of allowed values 
of cosmological parameters and the details of the galaxy formation process. For example, we 
can discriminate between models with low and high values of Qm, and between models with 
different values of agm (hence, different values of b). However, if the cosmological parameters and 
the mass power spectrum can be determined precisely using other constraints, such as Type la 
supernovae, cosmic microwave background anisotropics, the Lyman-a forest, or weak lensing, then 
reconstruction analysis can focus on deriving the biasing relation between the different galaxy 
distributions and the underlying mass distribution. Knowledge of these relations will, in turn, 
provide strong tests of numerical and semi-analytic models for galaxy formation. 
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